Creating Domain-Specific Sentiment Lexicons via Text Mining
نویسندگان
چکیده
Sentiment analysis aims to identify and categorize customer’s opinion and judgments using either traditional supervised learning techniques or unsupervised approaches. Traditionally, Sentiment Analysis is performed using machine learning techniques such as a naive Bayes classification or support vector machines (SVM), or could make use of a sentiment lexicon, that is, a list of words that are mapped to a sentiment score. Our work focuses on generating a domain-specific lexicon using probabilities and information theoretic techniques. By employing text mining, we overcome the poor performance of transferred supervised machine learning techniques and remove the need to adapt an existing lexicon while maintaining accuracy. We show that text mining techniques performs as well as traditional approaches and we demonstrate that domain specific lexicons perform better than general lexicons in a sentiment analysis task. We further review and compare the generated lexicons.
منابع مشابه
Inducing Domain-Specific Sentiment Lexicons from Unlabeled Corpora
A word's sentiment depends on the domain in which it is used. Computational social science research thus requires sentiment lexicons that are specific to the domains being studied. We combine domain-specific word embeddings with a label propagation framework to induce accurate domain-specific sentiment lexicons using small sets of seed words. We show that our approach achieves state-of-the-art ...
متن کاملPerformance Investigation of Feature Selection Methods
Sentiment analysis or opinion mining has become an open research domain after proliferation of Internet and Web 2.0 social media. People express their attitudes and opinions on social media including blogs, discussion forums, tweets, etc. and, sentiment analysis concerns about detecting and extracting sentiment or opinion from online text. Sentiment based text classification is different from t...
متن کاملRough Set Techniques for Text Classification and Sentiment Analysis in Social Media
Sentiment Analysis (SA) is an ongoing research in the field of text mining and classification. SA finds a computational domain from opinions and subjectivity of text data in online social media. Sentiments are inherited in the form of simple lexicons with symbols and texts having noise of irregular texts in complex forms. It is also seen that the high dimensional growth of lexical blends used b...
متن کاملExploring Sentiment in Social Media: Bootstrapping Subjectivity Clues from Multilingual Twitter Streams
We study subjective language in social media and create Twitter-specific lexicons via bootstrapping sentiment-bearing terms from multilingual Twitter streams. Starting with a domain-independent, highprecision sentiment lexicon and a large pool of unlabeled data, we bootstrap Twitter-specific sentiment lexicons, using a small amount of labeled data to guide the process. Our experiments on Englis...
متن کاملDomain-Based Lexicon Enhancement for Sentiment Analysis
General knowledge sentiment lexicons have the advantage of wider term coverage. However, such lexicons typically have inferior performance for sentiment classification compared to using domain focused lexicons or machine learning classifiers. Such poor performance can be attributed to the fact that some domain-specific sentiment-bearing terms may not be available from a general knowledge lexico...
متن کامل